Rank in Wordlist | Frequency | Word |
---|---|---|
29793 | 31 | 100,000 |
32836 | 27 | 10,000 |
41761 | 19 | 2,000 |
41762 | 19 | 20,000 |
41770 | 19 | 500,000 |
43292 | 18 | 250,000 |
48872 | 15 | 15,000 |
48902 | 15 | 50,000 |
51215 | 14 | 5,000 |
53750 | 13 | 1,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
40700 | 20 | الله(صلى |
44361 | 18 | لـ(الجزيرة |
56656 | 13 | (أ |
62490 | 11 | لـ(النادي |
62990 | 11 | و(2 |
71408 | 9 | ل(الجزيرة |
78482 | 8 | و(4 |
80137 | 7 | A(H1N1 |
84295 | 7 | على(400 |
85988 | 7 | و(12 |
Rank in Wordlist | Frequency | Word |
---|---|---|
20938 | 50 | 2011)، |
24763 | 40 | %) |
32997 | 27 | السعودية)، |
37824 | 22 | %). |
37833 | 22 | 2)، |
40317 | 20 | 1)، |
44212 | 18 | عاماً)، |
45126 | 17 | إلخ)، |
46861 | 16 | أ)، |
47837 | 16 | ريال)، |
Rank in Wordlist | Frequency | Word |
---|---|---|
3860 | 389 | 50% |
5435 | 271 | 10% |
5586 | 263 | %. |
5766 | 254 | 20% |
7515 | 185 | 25% |
7516 | 185 | 30% |
7669 | 181 | 80% |
7809 | 177 | 5% |
7966 | 173 | 60% |
8755 | 155 | 40% |
Rank in Wordlist | Frequency | Word |
---|---|---|
63937 | 10 | AT&T |
112992 | 4 | C&C |
134877 | 3 | S&P |
256414 | 1 | ALT&AST |
256506 | 1 | AT&T، |
257517 | 1 | C&C، |
260856 | 1 | K&N |
260892 | 1 | KFSH&RC |
261792 | 1 | MfT&Jsmafe1ad |
261859 | 1 | MoIT&T |
Rank in Wordlist | Frequency | Word |
---|---|---|
17638 | 63 | $0 |
98172 | 5 | $، |
112182 | 4 | $)، |
112183 | 4 | $0علم |
112586 | 4 | 20$ |
132919 | 3 | $) |
132920 | 3 | $). |
132921 | 3 | $0وقد |
133924 | 3 | 50$ |
167036 | 2 | $04- |
Rank in Wordlist | Frequency | Word |
---|---|---|
239349 | 1 | $" |
Rank in Wordlist | Frequency | Word |
---|---|---|
94341 | 6 | لـ''الاقتصادية |
100396 | 5 | الاقتصادية''، |
134766 | 3 | O'Dea |
154693 | 3 | لـ''سمه |
168870 | 2 | 33''، |
171171 | 2 | Microsoft's |
171271 | 2 | O'Reilly |
172013 | 2 | Who's |
172032 | 2 | Women's |
172621 | 2 | x',y |
Rank in Wordlist | Frequency | Word |
---|---|---|
241458 | 1 | 104+24(IV))بت |
Rank in Wordlist | Frequency | Word |
---|---|---|
3886 | 387 | الدكتور/ |
6445 | 222 | الأستاذ/ |
10278 | 127 | و/أو |
12374 | 100 | الشيخ/ |
12933 | 95 | ج/ |
13042 | 94 | د/ |
13342 | 91 | 1433/1434هـ |
14086 | 85 | س/ |
15952 | 72 | 1434/1435هـ |
19281 | 56 | 1/ |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots